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We present a search for the Standard Model Higgs boson produced in association with a Z or a W boson, using data 
QQ ^ collected with the CDF II detector at the Tevatron accelerator. A scenario where the Z decays into neutrinos or charged 

1 leptons originating from the W-decay escape detection and the Higgs decays into a bb pair is considered. Therefore 

C " ^ ' the expected signature is large missing transverse energy (I^t)} no isolated leptons, and two b-jets. We present the 

preliminary results in this search using 1.7/fe ^ of data collected by CDF and the work on future improvements to 
^ ^' increase the sensitivity of the analysis. 
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1. INTRODUCTION 



D ' In the Higgs mechanism of the Standard Model, the fermions and weak gauge bosons acquire mass via interaction 
Q_i with the Higgs field which is described by a complex doublet. Three of the four real fields of the doublet couple to 
^ ■ the SU(2) gauge bosons. The observable quantum of the fourth Higgs field is called the Higgs boson. The existence 
of this undiscovered particle is the cornerstone of the Standard Model 



Direct searches performed with the LEP experiments have constrained the Higgs mass to be larger than 114.4 GeV 
J> ' at 95% C.L. 2']. In pp collisions at the Tevatron, the most probable production mode of the Higgs is by gluon fusion 
through a virtual top loop. Around 70% of the Higgs would decay into two &-quarks yielding two &-jets in the final 
] state. Since the QCD 6-quark production is an irreducible background, this analysis would have a low sensitivity. 
' The second most frequent production mode is when a virtual W or Z decays into a W or Z and a Higgs. In this case, 
Q\ • it is possible to trigger on the decay products of the W/Z boson and significantly reduce the QCD background. 

, We are analyzing Z-Higgs and W-Higgs associated productions when the Z decays into two neutrinos, or the W 
] decays leptonically but the charged lepton escapes the detection. Because the neutrinos will not be detected in the 
^ ■ calorimeter, either, they lead to an unbalanced transverse energy sum in the transverse plain (I^t)- 
•i— I , 

5^ , 2. DATA SAMPLE AND EVENT SELECTION 



We use data collected through March 2007, which corresponds to 1.7 /5 ^ integrated luminosity. The events are 
collected by CDF II detector with a trigger that selects events with I^t > 25 GeV at Level 1 at least two Level 2 
clusters with Et > 10 GeV and > 35 GeV at Level 3 . 

In the first step of the analysis, both the Monte Carlo and real data events are to pass a set of quality cuts to 
ensure that the possible beam and detector effects are removed from the data sample making it compatible with the 
simulation. The standard CDF jet clustering algorithm is used 0] with a jet cone of radius 0.4. Jet energies are 
corrected for calorimeter non-uniformity, non-linearity and energy loss in the un-instrumented regions of calorimeter 
and energy coming from different pp interactions during the same bunch crossing. The I^t of the event is then 
corrected with new jet energies. 

The trigger efficiency is obtained from data and is used to scale the signal and Monte-Carlo backgrounds to correct 
for event loss during data taking. The efficiency of the two-jet requirement is 100% if the offfine transverse energy of 
the most energetic jet is above 35 GeV, the second most energetic jet is above 25 GeV, and at least one of the jets 



has I77I < 1.0. The overaU efficiency of the onhne event selection is then parameterized by the offline corrected ifr 
and applied on the Monte Carlo samples providing a proper scaling for the simulated events. 

The final requirement imposed on the data and simulation before comparing them is the 6-tag requirement. We use 
two categories of SECVTX 6-tagging algorithm P|, tight and loose. The main difference between the loose and tight 
tagging algorithms is that the loose tagger has more efficient track selection. The 6-tagging efficiency for tight (loose) 
tagger is 40% (50%) and mistag rate is ^ 2% (4%). In this analysis events are split into two exclusive categories: 
events with 2 jets tagged by SECVTX loose algorithm or exactly one jet tagged by SECVTX tight algorithm. 

2.1. BACKGROUND ESTIMATION 

The backgrounds in this data sample have contributions from the following processes: QCD multi-jet production, 
top quark pair and single production, W or Z boson production with jets and diboson production (WW,WZ,ZZ). 
We simulate processes which yield real taggable objects, that is, when a 6- or a c-quark pair is created. Events with 
light flavor jets with a positive tag are considered to be mistags and are estimated from the data0|. The remaining 
background processes were generated with PYTHIaQ Monte Carlo event generator passed through CDF II detector 
simulation. 

After defining two control regions in the events passing the basic selection criteria, the Standard Model background 
is compared to the data. In the first control region (CRl) all events with identified leptons are vetoed, and the 
azimuthal angular separation between the second leading jet and the I^t is less than 0.4. This control region is 
dominated by QCD multi-jet events. The second control region (CR2) contains events with at least one lepton or 
isolated track and ip{2'"-'^ jet,]^T) > 0.4. This region is sensitive to Electroweak processes, and is used to check the 
overall shapes and normalizations of the Monte Carlo simulated processes. 





Control Region 1 


Control Region 2 


Process 


1 Tight tag 


2 Loose Tags 


1 Tight tag 


2 Loose Tags 


QCD h.f. 


24337.1 ± 111.4 ± 5445.4 


3768.5 ± 45.8 ± 688.2 


50.7 ±5.1 ±12.6 


7.0 ± 2.0 ± 1.9 


Top 


7.1 ±0.4 ±0.8 


2.3 ±0.2 ±0.4 


134.8 ±1.6 ±16.4 


55.9 ± 1.0 ±9.0 


Di-boson 


1.1 ±0.2 ±0.2 


0.1 ± 0.1 ± 0.1 


14.7 ±0.8 ±2.5 


1.9 ± 0.2 ± 0.4 


W + h.f. 


26.2 ± 2.7 ± 11.1 


1.1 ± 0.5 ± 0.4 


80.5 ±4.1 ±34.9 


8.2 ± 1.3 ± 3.6 


Z + h.f. 


8.7 ± 1.2 ± 3.6 


0.9 ±0.4 ±0.4 


17.5 ±1.8 ±7.9 


1.3 ±0.5 ±0.7 


Mistag 


6181.0 ±63.6 ±498.5 


415.2 ± 10.0 ± 71.1 


86.5 ±4.3 ±6.3 


3.7 ± 1.2 ±0.8 


Expected 


30561.2 ± 5469.7 


4188.1 ±693.4 


384.7 ±42.6 


77.9 ± 10.3 


Observed 


29431 


4190 


373 


79 



Table I: Number of expected background and observed events in the control regions 



After achieving a good agreement between the simulation and the data in the control regions (Tabled Figure [T]), a 
set of cuts are selected optimizing the signal Monte Carlo against the background prediction. The optimization yields 
the following selection requirements for signal region (SR): i^(l**jet,^T) > 0.8, Lp{2^'^jet,I^T) > 0.4, ^t/Ht > 0.45^, 
V^jet Et > 60 GeV, I^t > 70 GeV and no isolated leptons in the event. Table HIl shows the comparison between 
expected and observed event yields in the signal region. The dijet invariant mass distributions in the signal region 
are shown in FigO The excess of observed events in double-loose tagged sample has been extensively studied. No 
systematic source for disagreement has been found. The probability of having such an excess as a result of background 
only fluctuation was estimated to be '^3%. 



^Ht is the scalar sum and Ht is the magnitude of the vectorial sum of the px's of the two leading jets 



Process 


Single Tag 


Double Tags 


QCD h.f. 


157.4 ±9.0 ±49.1 


10.6 ±2.4 ±3.9 


Top 


48.2 ± 1.0 ±4.1 


14.0 ±0.5 ±2.1 


Di-boson 


11.5 ±0.6 ±2.4 


1.9 ± 0.2 ± 0.4 


W + h.f. 


59.9 ±4.1 ±26.6 


4.6 ± 1.1 ± 2.0 


Z + h.f. 


28.3 ± 1.9 ± 12.5 


4.1 ±0.7 ±1.9 


Mistag 


98.2 ± 7.3 ± 12.8 


4.7± 1.0± 1.1 


Expected 


403.5 ±60.1 


39.9 ±6.1 


Observed 


443 


51 


{VH niH = 115GeV/c2) 


(1.9) 


(1.2) 



Table II: Number of expected background and signal events in the signal region after applying the final cuts. 



2.2. CROSS-SECTION LIMITS 

Since there is no significant excess in the data compared to the predicted backgrounds, we set 95% C.L. upper limits 
on Higgs boson production cross-section times the branching fraction. The systematic uncertainties are classified 
as correlated and uncorrelated errors considering the relations between the signal and the background processes. 
The correlated errors are taken into account separately for each processes in the limit calculation. The uncorrelated 
systematic uncertainties are: statistical error in negative tag estimate, negative-positive tag rate asymmetry factor, 
QCD multi-jet Monte Carlo normalization (14% in single tagged, 6.3% in double tagged sample), MC statistical 
fluctuations. The correlated systematics are: luminosity (6.0%), ^-tagging efficiency scale factor between data and 
Monte Carlo (4.3% for single and 10.2% for double tags), trigger efficiency (3%), lepton veto efficiency (2%), PDF 
uncertainty (2%) and Jet Energy Scale. ISR/FSR systematic uncertainties (between 1% and 5%) are applied on the 
signal. 

Considering the systematic uncertainties listed above, we computed the expected limit for the Higgs cross-section 
when the Higgs is produced with a Z/W boson and decays to two 6-quarks where Z decays to neutrinos and W to 
leptons. We use Bayesian method for deriving the limits[a|. Table Hill shows the final result. All the cross-sections 
are ratios with respect to the Standard Model cross-section. 
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Figure 1: Dijet invariant mass distributions in control regions for: (left) double-tagged events in CRl, (right) single-tagged 
events in CR2 
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Figure 2: Dijet invariant mass in the Signal Region, single- and double-tagged events 
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Table HI: The predicted and observed cross-section limits of the ZH/WH processes combined when H ~> bb divided by the 
SM cross-section 



3. FUTURE PROGRESS 

Improvements of the analysis technique are being developed to further increase the sensitivity of the analysis. 
Below we summarize the progress in the main directions that are pursued. 

One of the main challenges in this search channel is the modeling of the large QCD multi-jet background. A model 
to estimate this background directly from data has been developed. In order to estimate the multi-jet background in 
the single-tagged sample we measure the probability to tag one jet from the "pretag"^ sample. Similarly, to estimate 
the multi-jet background in the double-tagged sample we measure the probability to tag a jet in a sample that already 
has one jet tagged. This method allows us to estimate the shapes and normalizations for multi-jet production in 



single-tagged and double-tagged categories, Fig 3(a) 



In order to increase the acceptance to the Higgs signal we accept events with three jets, for the first time at CDF 
in this channel. The main motivation is to accept events where one of the b quarks coming from the Higgs radiates a 
gluon. In addition to that, we also accept WH events where the charged lepton coming from the W is reconstructed 
as a jet. The latter case happens when the W decays to an electron and it fails the CDF electron identification 



^i.e. the events passing the basic selection criteria before the tagging requirement 
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Figure 3: Kinematic distributions in control regions 



algorithm, but is reconstructed as a jet; or when the W decays to and r ^hadrons. As can be seen from Fig 3(b) 
our background prediction for jet multiplicity agrees very well with the observed data. 

In order to effectively reduce the large QCD background, we need to get a good estimate of the event true missing 
energy. We do that by calculating the ifr, which is defined as the negative vectorial sum of charged particle pr's. 
For events with true I^t the ifr is highly correlated and parallel with calorimeter I^t, while for QCD events with 
mismeasured jets it is not. As shown in Fig 3(c) the A0(^t,^t) can serve as an excellent kinematic variable to 
discern real from fake I^t ■ 

For searches oi H ^ hh decays it is crucial to precisely meausure the energies of the jets coming from the 6-quarks. 
An algorithm similar to the one used by HI collaboration [tI has been implemented, which combines tracks and 
calorimeter towers to improve the reconstruction of the jets energies. Initial studies show that an improvement 
of ^ 10% in dijet mass resolution can be achieved. To further increase the final discriminant sensitivity, we are 
developing an artificial neural network to maximally separate Higgs boson signal from backgrounds. 

In summary, we have performed a direct search for the Standard Model Higgs boson decaying into 6-jet pairs 
in 1.7 data accumulated in Run II of the CDF detector. We do not observe any significant excess over the 

background predicted by the Standard Model, thus we set a 95 % C.L. upper limit for the Higgs boson at various 
masses. Multiple improvements in the analysis technique are being developed and will be incorporated in the next 
iterations of the analysis. 
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